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ABSTRACT 


It  is  shown  that  the  implementation  oi  many  mathematical  programming  algorithms 
using  spreadsheet  software  can  be  described  as  instances  of  a  nonprocedural  approach  to 
computer  programming.  This  concept  is  generalized  to  an  abstract  computing  environment 
called  HYPERCELLS.  Its  significance  in  view  of  rapid  development  in  parallel  comp¬ 
utation  is  discussed.  1 


1.  INTRODUCTION 

V\  uh  the  widespread  use  of  microcomputers  in  academia,  business,  industry  and 
government,  spreadsheet  software  has  become  the  most  popular  general  purpose  quant¬ 
itative  tool  for  numerical  computation.  It  is  essentially  a  two  dimensional  array  of  cells. 

Each  cell  may  contain  text  for  documentation,  numerical  values  from  data,  or  a  formula 
dependent  on  values  in  other  cells.  Whenever  the  value  in  a  cell  is  changed,  the  content  of  Z' 
cells  with  formulas  depending  on  it  can  be  recalculated  automatically.  A  simple  cxample  is ' 
given  in  Figure  1.  Cells  Bl  through  B4  contain  quarterly  revenues.  Cell  B6  contains  a 
formula  for  the  sum  of  the  values  in  Bl  through  B4.  It  therefore  represents  the  total 
revenue  for  the  year.  When  the  quanerly  values  are  entered  or  altered,  the  total  is  updated 
accordingly. 

Most  commercial  spreadsheet  packages  provide  a  substantial  set  of  mathematical 
functions  with  which  complex  formulas  can  be  written  for  nontrivial  computational 
models.  Specifics  of  a  design  include  the  order  of  computation  and  the  ability  to  resolve 
circular  references.  For  this  reason,  while  the  initial  thrust  has  been  in  finance  and 
accounting  applications,  there  is  a  growing  interest  in  this  type  of  computing  environment 
in  science  and  engineering. 

In  this  paper,  we  demonstrate  and  generalize  certain  abstract  properties  of  modeling 
and  computing  in  the  spreadsheet  environment.  They  can  be  interpreted  as  a  nonpro¬ 
cedural  approach  to  the  implementation  of  numerical  methods  in  general  and  mathematical 
programming  in  particulau".  A  nonprocedural  approach  to  computer  programming  is  one 
that  relies  more  on  the  characterization  of  the  solution  than  on  the  description  of  every  step 
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required  in  the  computation.  The  distinction  from  conventional  coii^uter  programming  is 
shown  in  Section  2.  The  cases  of  dynamic  and  linear  programming  are  presented  in 
Sections  3  and  4  respectively.  Section  5  introduces  the  concept  of  Hypercells  as  a  general 
abstraction  of  the  spreadsheet  and  discusses  the  potential  of  parallel  processing  within  this 
firamework. 

The  purpose  of  this  paper  is  to  formalize  to  some  extent  the  departure  from 
conventional  implementation  of  mathematical  programming  afforded  by  dte  spreadsheeet 
environment.  Its  significance  draws  from  both  the  ever  increasing  popularity  of  the  latter 
as  well  as  the  potential  for  generalization.  At  present  it  is  difficult  to  project  future 
development  in  this  direction  and  hence  no  attempt  will  be  made  to  compare  the  absolute 
effectiveness  of  the  different  approaches. 
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Second  Quarter 

$145,000.00 
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Third  Quarter 
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Total  Revenue 

$565,000.00 

Figure  1.  Simple  Example  of  a  Spreadsheet  Model 
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Figure  2.  Nonprocedural  Implementation  of  Matrix  Multiplication 

2.  NONPROCEDURAL  IMPLEMENTATION  OF  NUMERICAL  METHODS 

To  illustrate  the  nonprocedural  environment  provided  by  spreadsheet  software,  we 
use  the  simple  example  of  matrix  multiplication.  In  Figure  2,  the  range  of  cells  from  A1  to 
D3  contains  a  3  by  4  matrix  A.  The  range  of  cells  from  FI  to  H4  contains  a  4  by  3  matrix 
B.  The  product  C  is  in  the  range  from  J1  to  L3. 
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In  conventional  computer  programming,  a  procedure  can  be  coded  to  compute  the 
coefficients  of  the  product  matrix.  Indeed,  by  looping  over  i  from  1  to  3  and  j  from  1  to  3, 
Cij  is  given  by  the  inner  product  of  row  i  in  A  and  Column  j  in  B.  The  formula  for  the 
inner  product  is  specified  once  with  generic  indices  i  and  j  and  executed  repeatedly  as  i  and 
j  take  on  different  values  over  the  appropriate  ranges. 

Consider  next  the  situation  on  a  spreadsheet  The  value  of  Cn  is  given  a  formula 
in  ceU  Jl:  +$A1*F$1+$B1*F$2+$C1*F$3+$D1*F$4.  The  "S"  sign  indicates  an  absolute 
address  and  its  absence  indicates  a  relative  address.  Since  all  commercial  spreadsheet 
packages  include  such  feature,  this  formula  can  be  copied  to  all  the  odier  cells  in  C  using 
typically  a  single  operation.  Each  cell  in  C  will  then  contain  a  formula  for  the  appropriate 
inner  product.  When  entries  in  A  or  B  are  altered,  the  values  in  C  will  be  recalculated.  Note 
that  ultimately,  the  values  in  C  arc  computed  "proccdurally"  in  some  well  defined  sequence 
specific  to  the  particular  spreadsheet  environment.  What  we  mean  here  by  a  nonprocedural 
approach  is  that  there  is  no  explicit  code  in  some  computer  language  for  the  implementation 
of  the  algorithm  for  matrix  multiplication.  Instead,  the  spreadsheet  in  Figure  2  contains  the 
essence  of  the  operation,  namely,  that  every  coefficient  in  the  result  is  the  inner  product  of 
a  row  and  a  column.  It  is  quite  obvious  that  this  nonprocedural  approach  implies 
rcdunduncy  and  requires  more  memory  than  conventional  programming.  However,  this 
rcdunduncy  can  be  exploited  in  parallel  computation  with  multiproccsscrs.  In  our  example, 
each  inner  product  for  the  coefficient  of  the  product  matrix  can  be  computed  independently. 

Using  the  same  principle,  many  numerical  methods  can  be  implemented  on  a 
spreadsheet.  Examples  of  significant  potential  include  finite  difference  methods  for  the 
solution  of  partial  differential  equations.  In  the  following,  we  focus  on  mathematical  pro¬ 
gramming  and  illustrate  the  nonprocedural  approach  for  both  dynamic  and  linear  pro¬ 
gramming. 

3.  DYNAMIC  PROGRAMMING 

Consider  the  simple  example  in  Figure  3.  There  are  two  stages.  Stage  2  has  two 
states;  C  and  D.  State  C  has  a  single  choice  with  resulting  sute  E  and  immediate  payoff  of 
3  units.  State  D  has  a  single  choice  with  resulting  state  E  and  immediate  payoff  of  2  units. 
Stage  one  has  two  states;  A  and  B.  State  A  has  two  choices;  one  with  resulting  state  C  and 
immediate  payoff  of  2  units;  the  other  with  resulting  state  D  and  immediate  payoff  of  S 
units.  State  B  has  two  choices:  one  with  resulting  state  C  and  immediate  payoff  of  3  units; 
the  other  with  resulting  state  D  and  immediate  payoff  of  4  units.  The  DP  problem  is  to  find 
an  optimal  policy  which  prescribes  a  sequence  of  choices  from  each  starting  state  in  Stage 
1  to  termination  that  will  maximize  the  total  payoff. 


3 


Stage  1  Stage  2 


Figure  3.  A  Simple  Example  in  Dynamic  Programming 
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Figure  4,  Spreadsheet  Implementation  of  a  Dynamic  Programming  Model 


The  method  of  recursion  is  assumed  well  known  and  will  not  be  described  here.  To 
implement  this  method  on  a  spreadsheet,  it  is  necessary  to  design  a  format  for  the  DP 
model.  Figure  4  represents  the  above  example  in  the  format  used  in  [Ho  1987].  There  is  a 
section  with  five  columns  for  each  stage.  The  top  part  of  such  a  section  contains  the 
optimal  policy  for  the  stage.  For  instance,  from  State  A  at  Stage  1,  the  opdinal  choice  is  to 
go  to  State  D  in  the  next  stage  with  an  optimal  value  of  7.  Whereas  from  State  B  there  are 
alternative  optimal  choices  with  a  value  of  6.  The  rest  of  the  section  contains  the  choices 
for  each  state.  A  decision  is  specified  by  the  resulting  state  at  the  next  stage.  The  payoff 
now  is  problem  data.  The  payoff  later  is  given  by  the  optimal  value  of  the  resulting  state. 
The  toul  payoff  is  the  sum  of  payoff  now  and  payoff  later.  The  optimal  value  for  the  state 
is  the  maximum  of  the  total  payoffs  among  the  choices.  An  asterisk  indicates  an  optimal 
choice.  The  optimal  policy  is  the  optimal  choice  if  unique,  otherwise  there  are  alternatives 
indicated  by  multiple  asterisk' 

The  above  relations  in  the  DP  model  are  constructed  recursively  from  the  last  stage 
backwards.  The  actual  formulas  may  depend  on  the  particular  spreadsheet  environment.  In 
any  case,  it  is  possible  to  implement  the  DP  method  nonprocedurally  so  that  when 
immediate  payoffs  are  changed,  the  solution  will  be  recomputed  automatically.  Dynamic 
programming  recursion  is  perhaps  the  simplest  and  most  natural  example  of  the 
nonprocedural  implementation  of  optimization  algorithms  on  spreadsheets. 


4.  LINEAR  PROGRAMMING 

Both  the  algebraic  formulation  of  linear  programs  and  the  simplex  method  [Dantzig 
1963]  have  natural  tabular  formats;  On  a  spreadsheet,  one  can  encode  the  definition  of  the 
simplex  tableau.  Using  a  considerably  more  intricate  and  elaborate  design  of  a  recursive  system 
than  that  for  DP,  the  tableau  can  be  updated  automatically  until  the  stopping  criterion  is  met.  In 
the  following,  we  will  describe  a  panicular  implementation  in  some  detail. 

Consider  the  following  linear  programming  problem. 
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Figure  5.  Initial  Tableau  for  LP  Example 
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Figure  6.  Final  Tableau  for  LP  Example 

Figure  5  illustrates  a  spreadsheet  implementation  of  the  simplex  tableau.  Initially,  this 
represents  the  original  problem  in  standard  form.  However,  the  formulas  involved  are  defined 
in  such  a  way  that  subsequent  recalculations  lead  eventually  to  the  final  tableau  shown  in 
Figure  6.  The  required  formulas  are  tabulated  below  in  generic,  pseudo-code  format  so  that  we 
are  not  confined  to  the  specifics  of  particular  software. 
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Cell  Range 

Conicpt 

Eoimula 

A1 

Mode 

0  to  initialize;  1  thereafter 

Cl 

Iteration  Count 

If  Mode=0  then  0,  else  Iteration  Count+l 

FI 

Solution  Status 

If  SumOfInfeasibilities>0  then 
if  pivot  then  "Phase  1” 
else  "Infeasible" 

else  if  MaxReducedCosc^  then  "Optimal" 
else  if  pivot  then  "I^ase  2" 
else  "Unbounded" 

C3..I3 

Pivot  Column  Selector 

If  SumOflnfeasibilities>0  then 

if  Phase  IReducedCost  of  Column  s 

MaxPhaselReducedCost  and 

MaxPhase  IReducedCost  >  0  then 

Value=l 

else  Va]ue=0 

else  if  Phasc2ReducedCost  of  Column  = 

MaxPhase2RcducedCost  and 

MaxPhase2RcducedCost  >  0  then 

Value=l 

else  Valuc=0 

J3 

Pivot  Column  Indicator 

Max(C3..I3) 

K3 

Pivot  Column  Index 

If  PivoiColuntnIndicaior=0  then 

Value="Nonc" 

else  Value=index  of  first  column  with  Pivot 

Column  Selector  of  1 

C4..J4 

Column  Indices 

j=l . 8 

C6..J6 

Pivot  Rov. 

A  vertical  look-up  of  the  pivot  row 

A9..A14 

Row  Indices 

i=  - 1  for  Phase  1  Objective 
i=  0  for  Phase  2  Objective 
i  =  1,...,4  for  constraints 

B9..B14 

Pivot  Column 

A  horizontal  look-up  of  the  pivot  column 

C9..F9 

Phase  1  Reduced  Costs 
for  Logical  Variables 

If  Mode>0  then  NewValue=Simplex  Pivot 
Update  of  OldValue  (using  the  pivot 
and  appropriate  entries  in  the  pivot  row 
and  pivot  column) 

else  if  Logical  Variable  is  in  an  "  ="  row 

7 


then  Valuc*0 


G9..19 


J9 

C10..F10 


G10..I10 


JIO 


Cn.D12 


Phase  1  Reduced  Costs 
for  Structural  Variables 


Sum  of  Infcasibilitics 
Phase  2  Reduced  Costs 
for  Logical  Variables 


Phase  2  Reduced  Costs 
for  Structural  Variables 


Phase  2  Objective  Value 


,F14  Diagonal  entries  in 
Tableau  for  Logical 
Variables 


else  Value=Partia]  Sum  of  column 
corresponding  to  pivoting  on 
artificials 

If  Mode>0  then  NewValue^implex  Pivot 
Update  of  OldValue 
else  Va]ue=Partial  Sum  of  column 

corresponding  to  pivoting  on 
artificials 
Same  as  above 
If  Mode>0  then 

if  Logical  Variable  is  in  an  ”  «"  row 
then  Valuc=0 

else  NewValue=Simplex  Pivot  Update 
of  OldValue 
else  Valuc=0 

If  Modc>0  the  NcwValuc=Simplex  Pivot 
Update  of  OldValue 

else  if  Minimization  then  Value®  -  Objective 
Coefficient  of  Structural  Variable 
else  Value*Objective  Coefficient  of 
Sffuctural  Variable 

If  Mode>0  then  NewValue=Simplex  Pivot 
Update  of  OldValue 
else  Value  =  0 
If  Modc>0  then 

if  Row  is  Pivot  Row  then  NewValue  = 
Old  Value/Pivot 

else  NewValue=Simplex  Pivot  Update 
of  OldValue 

else  if  RightHandSide^  then 

if  not  constraint  then  Value=l 
else  Value® -1 

else  if  constraint  then  Value®  -1 
else  Value®! 


Cl  1..F14  except  above 


Off-diagonal  entries  in 
Tableau  for  Logical 
Variables 


Gil. .114  Tableau  for  Structural 
Variables 


J11..J14  Right-Hand-Sidc  of 
Tableau 


M9  Maximum  Phase  1 

Reduced  Cost 

MiG  Maximum  Phase  2 

Reduced  Cost 


If  Mode>0  then 

if  Row  is  Pivot  Row  then  NewValue  ® 
OldValue/Pivot 

else  NewValuesSimplex  Pivot  Update 
of  Old  Value 
else  Valuc=0 
If  Mode>0  then 

if  Row  is  Pivot  Row  then  NewValue  = 
OldValue/Pivot 

else  NewValue=Simplex  Pivot  Update 
ofOldValue 

else  if  RightHandSide>0  tiien 

Value=Matrix  Coefficient 

else  Value®  -  Matrix  Coefficient 
If  Modc>0  then 

if  Row  is  Pivot  Row  then  NewValue  = 
OldValue/Pivot 

else  NewValue=Simplex  Pivot  Update 
of  Old  Value 

else  Value* Absolute  Value  of 

RightHandSide 
If  Max(C9..I9)>0  then 

Value*  Max(C9..I9) 
else  Value*0 
If  Max(C10..I10)>0  then 

Value*  Max(  CIO.. 1 10) 
else  Value=0 


K11..K14  Basis  Indices  IfMode>0then 

(Index  of  Column  if  row  is  Pivot  Row  then 

basic  in  Row  )  NewValue=Index  of  Pivot  Column 

else  NewValue=01dValue 
else  if  RightHandSide^O  and  constraint 
then  Value=Row  Index 
else  if  RightHandSidecO  and  const¬ 
raint  then  Value*Row  Index 
else  Value*  -Row  Index 


y 


L11..L14  Pivot  Ratios 


If  nonbinding  row  then  Valuesinfinity 
else  if  TablcauEntry>Pivot  Tolerance  and 
RightHandSide^  then  Value* 
RightHandSide  /TaUeauEntry 
else  if  TableauEntry<  Zero  Tdennoe 

and  basic  column  is  aitificia] 
and  RightHandSide<  Zero 
Tolerance  thenValue*0 
else  Value*Infinity 


M11..M14 

Min  Ratio  Indicator 

L17 

Min  Ratio 

M17 

Pivot  Row  Index 

M18 

Pivot 

If  Ratio=MinRatio  and  MinRatio<Infinity 
then  Value*Row  Index 
else  Value=0 
M1N(L11..L14) 

MAX(M11..M14) 

If  PivotRow=0  then  Value*"nonc" 
else  Valuc=Pivot  Coefficient  in  Tableau 


Note  that  the  above  spreadsheet  template  is  a  complete  implemenution  of  the 
two-phase  simplex  method  for  general  LP  problems  with  any  type  of  constraints  and 
right-hand-sides.  A  similar  approach  that  is  less  compact  and  less  general  first  appeared  in 
(Carroll  1986].  To  gain  this  compactness  in  the  present  design,  it  is  necessary  to 
recalculate  various  ranges  of  the  spreadsheet  separately.  This  can  be  automated  by  a  few 
lines  of  macro  instructions  of  the  following  form. 

While  PivotColumnlndex^”None‘'  and  PivotRowIndex#"None" 

Recalculate  (A1..M18)  {Update  Tableau) 

Recalculate  (C3..K3)  (Find  pivot  column) 

Recalculate  (B9..B14) 

Recalculate  (LI  1..L17)  (Find  pivot  row) 

Recalculate  (M9..M18) 

The  basic  difference  between  the  spreadsheet  implementation  and,  say,  a  con¬ 
ventional  Fortran  code  of  the  simplex  method,  or  one  coded  in  a  spreadsheet  macro 
language  (Ho  1987]  is  that  the  former  emphasizes  the  actual  definition  of  the  important 
concepts  whereas  the  latter  deals  with  their  translation  into  computational  procedures. 
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Essentially  the  spreadsheet  in  Figure  5  says  that  the  simplex  tableau  is  the  update  of  itself. 
The  definition  of  the  update  is  incorporated  directly  into  the  formulas  for  the  cells  holding 
ihu  tableau,  just  like  the  defintition  of  matrix  multiplication  is  expressed  in  our  earlier 
example.  The  process  of  constructing  the  entire  LP  spreadsheet  can  also  be  automated  with 
e.g.  macro  programs  that  define  all  the  appropriate  ranges  and  formulas  according  u>  given 
dimensions.  Functional  versions  of  such  software  using  Lotus  1-2-3  [Lotus  Development 
Corporation  1985]  are  used  in  this  work.  While  we  are  not  concerned  with  direct 
comparisons  of  computational  efficiency  at  this  stage,  it  is  clear  that  memory  usage  in  the 
nonprocedural  approach  is  going  to  be  high  because  of  the  replication  of  formulas. 
However,  such  redunduncy  may  be  exploited  eventually  in  parallel  computadons  as  we 
shall  discuss  in  some  generality  in  the  next  section.  In  terms  of  solution  speed,  it  is 
significantly  more  efficient  than  a  macro  program  for  the  simplex  method  [Ho  1987].  To 
date,  our  initial  e.xperience  has  been  with  small  textbook  problems  with  less  than  50 
constraints. 


5.  HYPERCELLS  AND  PARALLEL  COMPUTATION 

The  spreadsheet  belongs  to  a  new  paradigm  in  computing  environments  that  is  still 
in  its  infancy.  Our  effort  is  to  illustrate  its  implications  in  mathematical  programming  rather 
than  to  extol  its  actual  efficacy.  Critics  who  hasten  to  challenge  the  significance  of  this 
association  should  be  reminded  of  the  case  with  parallel  computing.  There  too,  one  often 
found  it  difficult  to  justify  radical  approaches  for  incremental  improvements.  However,  as 
multiprocessor  computers  become  prevalent,  a  constructive  rethinking  of  computational 
methodology  begins  to  take  shape.  Along  such  lines,  we  conclude  the  paper  with  a  vision 
of  the  nonprocedural  environment. 

Consider  the  n-dimensional  generalization  of  the  two-dimensional  spreadsheet.  It  is 
an  array  of  n-tuples  that  we  shall  call  hypercells.  Each  hypercell  may  contain  text,  a  value 
or  a  function  in  values  in  other  hypercclls.  Functions  are  defined  in  a  given  library  and 
should  include  all  the  usual  mathematical  and  logical  operations.  A  protocol  for  the 
recalculation  of  functional  values  is  also  given. 

As  an  example  of  the  possible  application  of  such  a  computing  environment,  we  can 
extend  the  above  optimization  models  and  have  a  third  dimension  representing  changes  in 
parameters  as  a  function  of  discrete  time  points.  The  nonprocedural  implementation  of 
linear  or  dynamic  programming  can  then  provide  time-phased  solutions  to  a  sequence  of 
related  problems.  At  this  writing,  the  forthcoming  version  of  a  popular  spreadsheet 
software  package  is  already  slated  to  be  three-dimensional. 

With  the  rapid  development  of  parallel  computer  architecture  (see  e.g.  [Fox  and 
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Messina  1987]),  it  will  also  become  attractive  to  distribute  the  tasks  of  recalculating  the 
hypercells  over  an  array  of  processors.  The  fact  that  formulas  are  explicitly  associated  with 
each  hypercell  in  the  nonprocedural  implementation  of  numerical  methods  makes  it 
especially  suitable  for  parallel  processing.  The  main  concern  with  programming  in  this 
environment  will  be  the  control  of  the  interdependencies  of  data  with  respect  to  the  given 
protocol  for  recalculation.  In  cases  where  most  of  the  hypercells  can  be  updated 
independently  and  concurrently,  like  most  entries  in  our  LP  tableau,  this  parallel 
nonprocedural  approach  should  be  very  efficient.  While  the  performance  of  single 
processors  are  bounded  by  physical  limits,  their  costs  arc  continuing  to  lower.  For  this 
reason,  a  multiprocessing  hypercell  computing  environment  should  become  viable  in  the 
foreseeable  future 
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